# Real-time Caption Generation
Tiny Image Captioning
Apache-2.0
A lightweight image captioning model based on bert-tiny and vit-small, weighing only 100MB, with extremely fast performance on CPU.
Image-to-Text
Transformers English

T
cnmoro
4,298
2
Llava NeXT Video 34B DPO
Llama 2 is a series of open-source large language models developed by Meta, supporting various natural language processing tasks.
Video-to-Text
Transformers

L
lmms-lab
214
10
Featured Recommended AI Models